Across-Document Neighborhood Expansion: UMass at TAC KBP 2012 Entity Linking
نویسندگان
چکیده
Last year’s competition demonstrated that the NER context contains important information that should not be ignored in entity linking. State-of-the-art approaches anchor on unambiguous entities, look for overlap in categories, or approximate a joint model of candidate assignments, after Wikipedia candidates have been selected. Current candidate approaches, such as anchor text maps, are effective but may lead to very large candidate sets to be examined. UMass has two objectives for our TAC submission. First, we use cross-document context information to perform entity neighborhood expansion and estimate the importance of entity context using corpus-wide information. Second, we use probabilistic information retrieval that incorporates the neighborhood information to generate a ranked candidate set in a single step. The result is a small candidate set that even for less than 50 candidates contains the true answer in 95% of the cases, allowing for computationally intensive inference in the next phase. It turns out that our best performing run simply predicts the top candidate of the unsupervised candidate ranking, outperforming more than half of the contestants.
منابع مشابه
UMass CIIR at TAC KBP 2013 Entity Linking: Query Expansion using Urban Dictionary
This paper describes the system submitted to the TAC 2013 entity linking task of the Knowledge Base Population track. The core of the approach is probabilistic information retrieval over a search index of the knowledge base, including the text of Wikipedia. The retrieval results are further reranked using a supervised learning-to-rank model. The submission this year builds on the neighborhood a...
متن کاملThe TALP participation at TAC-KBP 2013
This document describes the work performed by the Universitat Politècnica de Catalunya (UPC) in its second participation at TAC-KBP 2013 in both the Entity Linking and the Slot Filling tasks.
متن کاملLIA at TAC KBP 2012 English Entity Linking track
This paper describes our participation in the English Entity Linking task at KBP 2012.
متن کاملContext-Based Entity Linking - University of Amsterdam at TAC 2012
This paper describes our approach to the 2012 Text Analysis Conference (TAC) Knowledge Base Population (KBP) entity linking track. For this task, we turn to a state-of-the-art system for entity linking in microblog posts. Compared to the little context microblog posts provide, the documents in the TAC KBP track provide context of greater length and of a less noisy nature. In this paper, we adap...
متن کاملPRIS at TAC2012 KBP Track
Our method to Knowledge Base Population at TAC2012 is described in this paper. An enhanced pattern bootstrapping system is mainly utilized in the Slot Filling task. And for the Entity Linking task, query expansion method, rule-based method and entity similarity ranking strategy are combined.
متن کامل